Agents and Tools

Demo

This code demo is a tiny UTD course-planning assistant. It shows the core agent idea without a giant framework: an LLM can ask Python to run tools, Python runs them, and the results go back to the LLM.

The fun twist: we break the work into one extra layer. The top-level agent does not do every task itself. It calls smaller specialist subagents.

Start It Up

From the project root:

python -m venv .venv
.venv\Scripts\Activate.ps1
pip install -r requirements.txt

Create a .env file in the project root:

GOOGLE_API_KEY= YOUR_GOOGLE_AI_STUDIO_KEY
GEMINI_MODEL= gemini-3.1-flash-lite

Run the app:

uvicorn backend.main:app --reload

Then open:

http://localhost:8000

FastAPI also gives you API docs at:

http://localhost:8000/docs

The Architecture

User
-> FastAPI /chat
-> orchestrator_agent
   |-- EligibilityAgent
   |   |-- GetStudentHistory
   |   |-- GetCourseCatalog
   |   `-- GetCourseInfo
   `-- ReviewsAgent
       `-- GetRMPScore
-> final answer

The big idea is simple: multi-agent systems break a bigger task into smaller parts. The orchestrator coordinates. The subagents focus. The tools fetch real data.

In this project:

backend/main.py exposes the /chat endpoint.
backend/orchestrate.py runs the top-level orchestrator agent.
backend/subagents.py runs the specialist subagents.
backend/data_tools.py contains the actual Python tools.
backend/data/*.json is the mock database.
frontend/ is the small browser UI.

The Agent Loop

The loop is the magic trick, except the rabbit is just JSON.

Send the user question to Gemini.
Gemini either answers or asks to call a tool.
Python runs the requested tool.
Python sends the tool result back to Gemini.
Gemini uses that result to answer or ask for another tool.

The orchestrator loop lives in backend/orchestrate.py:

response = client.models.generate_content(
    model=model_used,
    contents=contents,
    config=config,
)

function_calls = getattr(response, "function_calls", None) or []
if not function_calls:
    return response_text(response), used_tools, model_used

If Gemini asks for tools, we route those requests to Python:

tool_result = call_orchestrator_tool(call.name, args, student_id, trace)
tool_results.append(
    types.Part.from_function_response(name=call.name, response=tool_result)
)
contents.append(types.Content(role="tool", parts=tool_results))

That last line is important: it gives Gemini the tool results so it can keep reasoning with fresh information.

Tools vs Subagents

A normal tool is a Python function. Example from backend/data_tools.py:

def get_student_history(studentID: str) -> dict[str, Any]:
    students = load_json("students.json")
    student = students.get(studentID)

    if student is None:
        return {"status": "not_found", "studentID": studentID}

    return {"status": "success", "studentID": studentID, **student}

A subagent is also exposed like a tool, but instead of directly reading JSON, it runs its own smaller agent loop.

The orchestrator only sees these two "tools":

ORCHESTRATOR_TOOL_SCHEMAS = [
    {"name": "EligibilityAgent", ...},
    {"name": "ReviewsAgent", ...},
]

Then the Python router calls the right subagent:

def call_orchestrator_tool(name, args, student_id, trace):
    if name == "EligibilityAgent":
        return run_eligibility_agent(args["question"], student_id, trace)
    if name == "ReviewsAgent":
        return run_reviews_agent(args["question"], trace)

So the orchestrator delegates:

Eligibility questions go to EligibilityAgent.
Professor review questions go to ReviewsAgent.

The Subagents

The eligibility subagent has course-planning tools:

GetStudentHistory
GetCourseCatalog
GetCourseInfo

The reviews subagent has review data:

GetRMPScore

Both subagents use the same pattern as the orchestrator:

config = types.GenerateContentConfig(
    system_instruction=instruction,
    tools=[types.Tool(function_declarations=tool_schemas)],
)

That means each agent gets:

instructions for how to behave
a list of tools it is allowed to call
a conversation history called contents

Trace

The frontend shows a trace after each response. This is there so you can see the agent thinking path without guessing.

You will see events like:

orchestrator - llm_request
orchestrator - tool_call_started
eligibility_agent - tool_calls_requested
eligibility_agent - tool_call_finished
reviews_agent - final_answer
orchestrator - final_answer

If something goes wrong, the trace tells you where:

Did the orchestrator call the wrong subagent?
Did a subagent call the wrong tool?
Did a tool return not_found?
Did Gemini return an API error?

Tiny debugger, big usefulness.

How This Connects To MCP

Right now the tools read local JSON files. That keeps the workshop easy to run.

But the shape is already MCP-friendly:

GetRMPScore today -> reads backend/data/rmp_reviews.json
GetRMPScore later -> calls a Brave Search MCP server

The agent does not need to know whether the tool reads JSON, calls an API, or uses MCP. It just asks for GetRMPScore, and Python handles the implementation.

Summary

What we created is an LLM orchestrator that can call specialized subagents, and those subagents can call Python functions as tools. By breaking the task into smaller parts, each subagent handles one focused job and passes its result back to the orchestrator. The orchestrator then combines those results to reason across multiple steps and produce the final answer. This lets the system gather information it knows how to access through tools instead of relying only on the model's memory.

Demo

Start It Up​

The Architecture​

The Agent Loop​

Tools vs Subagents​

The Subagents​

Trace​

How This Connects To MCP​

Summary​